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Abstract — We consider general multi-antenna fading channels 
with coarsely quantized outputs, where the channel is unknown 
to the transmitter and receiver. This analysis is of interest in 
the context of sensor network communication where low power 
and low cost are key requirements (e.g. standard IEEE 802.15.4 
applications). This is also motivated by highly energy constrained 
communications devices where sampling the signal may be more 
energy consuming than processing or transmitting it. Therefore 
the analog-to-digital converters (ADCs) for such applications 
should be low-resolution, in order to reduce their cost and power 
consumption. In this paper, we consider the extreme case of 
only 1-bit ADC for each receive signal component. We derive 
asymptotics of the mutual information up to the second order in 
the signal-to-noise ratio (SNR) under average and peak power 
constraints and study the impact of quantization. We show that 
up to second order in SNR, the mutual information of a system 
with two-level (sign) output signals incorporates only a power 
penalty factor of almost -| (1.96 dB) compared to the system 
with infinite resolution for all channels of practical interest. This 
generalizes a recent result for the coherent case Q). 

I. Introduction 

Several contributions studied MIMO channels operating in 
Rayleigh fading environments, especially in the low SNR 0, 
0, El, 0, 0, and high SNR regime. Unfortunately, 
most of these contributions assume that the receiver has 
access to the channel data with infinite precision. In practice, 
however, a quantizer (analog-to-digital converter) is applied to 
the receive signal, so that the channel measurements can be 
processed in the digital domain. 

The relative simplicity of realizing and integrating functions 
digitally make it desirable, to move the analog-to-digital inter- 
face further towards the antenna. However, both the resolution 
and the speed required of the ADC tend to rise, making them 
expensive, power intensive and even infeasible, especially in 
ultra-wideband and/or high speed applications. In fact, in order 
to reduce circuit complexity and save power and area, low 
resolution ADCs have to be employed [9|. 

In 0, iflOl . we study the effects of quantization from 
an information theoretical point of view for MIMO systems, 
where the channel is perfectly known at the receiver. It turns 
out that the loss in channel capacity due to coarse quantization 
is surprisingly small at low to moderate SNR. In ifTTI the 
block fading SISO noncoherent channel was studied in details. 
Motivated by these works, we aim to study the communication 
performance of general noncoherent fading channels taking 
into account the coarse quantization. We consider the extreme 
case of 1-bit quantized (hard-decision) MIMO channel with 



no CSI at the transmitter and the receiver. When a single bit 
hard-limiter is used, the implementation of the all digital re- 
ceiver is considerably simplified [12], [13], [14|. In particular, 
automatic gain control (AGC), linearity requirements of RF 
components and multipliers for signal correlation are no more 
necessary. 

Our paper is organized as follows. Section HIl describes the 
system model and notational issues. In Section [III] we give 
the general expression of the mutual information between the 
inputs and the quantized outputs of the MIMO system, then 
we expand it into a Taylor series up to the second order 
of the SNR in Section [IV] Finally, in Section |VJ we utilize 
these results to elaborate on the asymptotic capacity of 1-bit 
multiple receive antennas (SIMO) channels with delay spread 
and receive correlation in a Rayleigh flat-fading environment. 

II. System Model And Notation 

We consider a point-to-point quantized MIMO channel 
with M transmit dimensions (e.g. antennas) and N receive 
dimensions at the receiver. Fig. [T] shows the general form 
of a quantized MIMO system, where H E fcNxM - ls a 
random channel matrix. The entries of this channel matrix 
are jointly complex circular Gaussian with zero-mean and 
possible mutual correlations. The channel realizations are 
unknown to both the transmitter and receiver. The vector 
x 6 C M comprises the M transmitted symbols, assumed to be 
subjected to a some average power constraint and peak power 
constraint. The vector r\ represents the additive noise, whose 
entries are i.i.d. and distributed as CAf(0, 1). r £ C N is the 
unquantized channel output 

r = y/pHx + T], (1) 

where p represents the signal-to-noise ratio. In our system, 
the real parts r; j? and the imaginary parts rij of the receive 
signals r,, 1 < i < N, are each quantized by a 1-bit resolution 
quantizer. Thus, the resulting quantized signals read as 

y ltC = sign(r 4jC ) e {-1, 1}, for c £ {R, J}, 1 < i < N. 

Throughout our paper, denotes the i-th element of f]K) 
vector a and [a]j )C = a,. c with c € {R, 1} is the real or 
imaginary part of cij. The operators (-) H and tr(-) stand for 
Hermitian transpose and trace of a matrix, respectively. Vectors 
and matrices are denoted by lower and upper case italic bold 
letters, respectively. 
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Fig. 1 

One-bit Quantized MIMO System 



III. Mutual Information 

The mutual information (in nats/s/Hz) between the channel 
input and the quantized output in Fig. Q] reads as lfT31l 



I(x,y) = H(y)-H(y\x) 



= E a 



P(y\x) 



(3) 



with P(y) = E x [P(y\x)] and E x [] is the expectation 
taken with respect to x. Herewith, H(-) and H(-\-) 
represent the entropy and the conditional entropy, 
respectively. Given the input x, the unquantized output 
r is zero-mean complex Gaussian with covariance 
E[rr H |a;] = (l N + p& H [Hxx^ if H ]), and thus we 
have 



p(r\x) 



exp(-r H (Ijv + pE H [Hxx^H^])- 1 ^ 
7r^det(Iiv + p^nlHxx^H 11 }) 



(4) 



Afterwards, we can express the conditional probability of the 
quantized output as 

/>oo />oo 

p {y\x) = /•••/ p(yr\x)dr = 
Jo Jo 

r °fap (-(j/ « r) H (Ijy + pE[Hxx H H ^^(y • r)) 
ir N det(l N + p^[Hxx n H n ]) 



JO 



dr, 
(5) 

where the integration is performed over the positive orthant of 
the complex hyperplane and y • r denotes an dimension-wise 
vector product with [y • r]j c = j/i.c^c- 
The evaluation of this multiple integral is intractable in gen- 
eral. Thus let consider first a simple lower bound involving the 
mutual information under perfect channel state information at 
the receiver studied in (T), By the chain rule and the non- 
negativity of the mutual information 



I(x, y) = I(H, y) + I(x, y\H) - I(H, y\x) 
>I(x,y\H)-I(H,y\x) 



H 



p , , „ M E H [P(y\x,H)] 
y^ X > H > E x[ P(y\x,H)} 



v 



(6) 



where we can express each of the conditional probabilities 
P(y\x 1 H) as the product of the conditional probabilities on 
each receiver dimension, since all of the real and imaginary 
components of the receiver noise r\ are statically independent 



with power i 



/v 



P(y\x,H)= [] \{P{y c Ax) 

ce{B,J} i=i 



(7) 



c£{R,I} »=1 



with 



J- 



2 dt is the cumulative normal 



distribution function. Evaluating the lower bound in (O, even 
numerically, is very difficult, except for some simple cases, 
like SISO block faded channels [11]. Thus, in the following, 
we shall derive a quadratic approximation of the mutual 
information. 

IV. Second-order Expansion of the Mutual 
Information 

In this section, we will elaborate on the second-order 
expansion of the input-output mutual information ([3]) of the 
considered system in Fig. Q] as the signal-to-noise ratio goes 
to zero. We state the main result and we prove it at the end 
of the section. 

Theorem 1: Consider the one-bit quantized MIMO sys- 
tem in Fig. Q] under an input distribution p(x) satisfying 
E x [||a;||4 +e ] < S for some finite constants e.S > 0. Then, 
to the second order, the mutual information (in nats) between 
the inputs and the quantized outputs is given by 



l{x,y) 



p tr- 



{E (nondiag(E[ffa;a; H i? H | ; E])) 2 
(nondiag(E[i?E[a;a; H ]i? H ])) 2 | + AI(x,y), 



o(p 2 



where nondiag(A) is obtained from A by setting 



(8) 
all its 



diagonal entries to zero, and 
to the power 4 defined as 



a\\ A is the 4-norm of a taken 



4 

a 4 



A. Comments on Theorem [7J 

As example, Fig. [2] illustrates the lower bound (|6]l and the 
quadratic approximation ([8]l computed for a block faded SISO 
model with a coherence interval of 3 symbol periods (H = 
h ■ I 3 , h ~ C7V(0, 1)) under QPSK signaling. Note that the 
lower bound <JSJ is tight, since I(H,y) = in this case fTTI . 

A similar result was derived by Prelov and Verdu in Q for 
the soft output r 



I(x,r) =- /9 2 tr|E (ElHxx^H^lx]) 2 



-{ElHElxx^H 11 }) 2 } + o(p 2 



(9) 



where we identify a power penalty of ^ due to quantization 
and that the diagonal elements of F 1 h[Hxx e -H B '] do not 
contribute to the mutual information in the hard-decision 
system. This is because of the fact that no mutual information 
can be extracted from the amplitude of each two-level received 
signal. That means that the channel coefficients have to be 
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Clearly P(y\x) p= o — 1/A N , and since P(y\x) is a probability 
mass function, i.e. J2 y P(v\ x ) = 1> we nave 

Y,P'(y\x) = Y, p "(v\ x ) = °- (13) 

y v 
Hence, to the second order, the conditional entropy has the 
following quadratic approximation: 

H(y\x) « 2iVln2- [P'(y\x)%] . {U ) 
v 

In a similar way, we can show that 

H{y) « 27Vln2 - ^ (E.[P'(y|x) p= o]) 2 • (15) 



Fig. 2 

Mutual information of the mono-bit block Rayleigh-faded 
siso channel with block length 3. 



correlated, otherwise Ejj [Hxx H H H ] is diagonal and the 
mutual information is zero. Nevertheless in most systems of 
practical interest correlation between the channel coefficients, 
whether temporal or spatial, exists even in a multipath rich 
mobile environments, thus Ejj [Hxx H H H ] is rather a dense 
matrix, whose Frobenius norm is dominated by the off- 
diagonal elements rather than the diagonal entries. Thus the 
low SNR penalty due to hard-decision is nearly 1.96 dB for 
almost all practical channels. 

B. Proof of Theorem [7] 

As first step, we compute the conditional entropy H(y\x) 
up to the second order of the SNR using a quadratic Taylor 
expansion of P(y\x) around p = 

P(y\x) » P(y\x) p=Q + P P'(y\x) p=0 + ^P"(y\x) p=a , 

(10) 

where P'(y\x) p= o and P"(y\x) p= o are the first and second 
derivatives of P(y\x) with respect to p. 
Then, its logarithm can be approximated as 



\nP(y\x) &]nP(y\x) p = +P 



P'(y\x 



p=0 



El 

2 



P(y\x) p=0 
P"{y\x) P =o ( P'{y\x) p=0 
P(y\x) p=0 \P(y\x) p=0 



(11) 



Therefore, 



H(y\x)*-J2 



P(y\x) p=a \nP(y\x) p=Q 
- P P'(y\x) p=Q (l + \n(P(y\x) p=Q )) 

2 / P'(v\x) 2 _ " 

1 [ P"{y\x) p=Q {l + \n{P{y\x) p=Q ))+ ! 



P{y\x) p=0 



(12) 



So, combining these results gives the quadratic expansion of 
the mutual information 

I(x,y) = H(y)-H(y\x) 

V 

(16) 

Now, the required first derivative is given by this result. 

Lemma 1: The first derivative of the conditional probabili- 
ties in the hard-decision system with respect to p is 

1 1 

(17) 



P'{y\x) p=0 = • nondiag^iia^if 11 ]) • y. 



Proof: Let us first expand the integral in <(5j to the first 
order. To this end we use the following approximation of 

exp(-||r|| 2 ) 



p(yr\x) 



TV 



N 



1 - ptr(E H [Hxx H H H }) 



+ptr{(y • r)(y • r) H Ej* [Hxx H H H ]} 



Afterward, it can be shown that 

... v -{Vi • n){Vj • rj)*dr 



oo roc 



JO 



7T 
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V M else 



(18) 
for i = j 
(19) 



Thus, 



o Jo 



-tr((y • r)(y • r) H E £r [JJa;a; H iJ H ])dr = 



itr{(Ijv + -nondi&g{yy K ))E„[Hxx H H H }} = 

4 7T 

^tv{V H [Hxx n H n ]} + ^y n non&mg(V H [HxxV])y, 

77 (20) 
since tr(nondiag(A).B) = tr(Anondiag(S)) for any two 
quadratic matrices A and B. Integrating the quadratic approx- 
imation in (fT8l over the positive orthant by making use of d20t 
and a differentiation with respect to p leads to the result of 
the lemma. ■ 

'The following identities are useful: J^detQ = dotQ ■ tr(Q _1 ) and 



dp 



-Q-^Q-\ 



dp 



Next, we plug the result of Lemma Q] into the required 
expression of the mutual information in ( TToT ); and finally, we 
evaluate the summation over all possible quantized outputs y 
according to the following identity 

& E (y H ^) 2 - 4tr(P 2 ), (21) 
ye{±l±j} N 

which can verified for any zero-diagonal matrix P by expand- 
ing the left-hand side and identifying the nonzero terms. This 
yields the result stated by the theorem and completes the proof. 
Note that the condition Ea;[||a;|| 4 +e ] < S for some finite 
constants e, 5 > is necessary, so that the rest term of the 
expansion having the order 



AI(x,y)=E x [o{\\x\\\p 2 )], 



(22) 



satisfies 



since 



lim 

p-s-0 



AI(x,y) 



AI(x,y) = E x 

< E a 

< E a 

< E„ 



[(\\<p 2 ) 1+1 



for some e' elO, el 



x 



|4+e' 



l*ll* +e lW+^ 



< 5 4 +' p 



4 

2+4 



(Holder's inequality) 



o( P 2 



V. Application to SIMO Channels with Delay 
Spread and Receive Correlation 

In this section, we use the result from Theorem Q] to 
compute the low SNR mutual information of a frequency- 
selective single input multi-output (SIMO) channel with delay 
spread and receive correlation. The unquantized output of the 
considered model at time k is 



T-l 

; E 

t=0 



r k = VP E h k[t]xk-t + Vk e 



(24) 



where the noise process {»7 fe } is i.i.d. in time and space, while 
the T fading processes {/ifc[£]} in each tap t are assumed to be 
independent zero-mean proper complex Gaussian processes. 
Furthermore, we assume a separable temporal spatial correla- 
tion model, i.e. 



E[h k [t]h k >[t 



nHi 



R-r(k-k')a t S[t-t']. (25) 



Here, R denotes the receive correlation matrix, r(k) is the 
autocorrelation function of the fading process, and the scalars 
OH represents the delay power profile. These correlations 
parameters can be normalized so that 

T-l 

ti(R) = N, r(0) = 1 and ^ a t = 1. (26) 

t=o 

In other words, the energy in each receive antenna's impulse 
response equals one on average. On the other hand we 
assume the the transmit signal Xk is subjected to an average 
power constraint E[||xfe|| 2 ] < 1 and a peak power constraint 



\x k \ 2 < P, Vfc, with /3 > 1. It should be pointed out that a 
peak power constraint constitutes a stronger condition than 
necessary for the validity of Theorem Q] involving just a 
fourth-order moment constraint on the input. We consider now 
a time interval of length n (a block of n transmissions) and 
introduce conveniently the following notation. Collect a vector 
sequence y k of length n into the vector y as 

!/W = [i£_i,...,»J] T , (27) 
and form the block cyclic-shifted matrix e C (A, " )x ™, 
X_i[0] ••• h n -x[T-l] ••• 



(23) With 



h [T-l] 



3^ — [^n— 1 5 ■ 



h [0] 



>^o] T , 



(28) 



(29) 



and rfv and defined similar to y^, the following un- 
quantized space-time model may be formulated as a (loose) 
approximation of (|24] | 



(30) 



A. Asymptotic Achievable Rate 

Now, we elaborate on the asymptotic information rate of 
this channel setting. First we establish an upper bound and 
than we examine its achievability. 

Proposition 1: If r(k) is square-summable, then the mutual 
information of the described space-time model admits, for any 
distribution fulfilling the average and peak power constraints, 
the limiting upper bound 

lim i/(a/ n W n) )< (-p) U{p), (31) 



where 

U{fi) 



/3 ■ /j, + (fi - l)a for /3(/i + a) > 2a 

r 4a 



else, 



and 



tr(# 2 )5>(fc) 2 
fc=i 



o = -tr((nondiag(i?)) 2 ) 



(32) 



(33) 



(34) 



Proof: Due to the peak power constraints, conditions of 
Theorem Q] are satisfied; thus the second order approximation 
([8]l is valid. A tight upper bound is obtained by looking at 
the maximal value that can be achieved by the expression 
^ up to the second order. We do this in two steps. We 
first maximize the trace expression in ^ under a prescribed 
average power per symbol 7. The maximum can be, in turn, 
upper bounded by the supremum of the first term minus the 
infimum of the second term under the prescribed average 



power and the original peak power constraint. After that, we 
perform an optimization over the parameter 7 itself. That is 

/(^W«))<i(^) 2 o rn^| 

sup tr(E[(nondiag(E[ii< n) ^a;( ,l ^ H H (n)3 |cc])) 2 j ) 

B[||aW|P]=7n 

- inf trf(nondiag(E[i^ n) E[a;W^' H ]i^ n)3 ])) 2 H+o(p 2 ). 

* fc |<,3,Vfc V / 

E[||a^|| 2 |=7n ' 

(35) 

It turns out that the supremum of first trace term is achieved 
when all Xk inputs take, simultaneously during the considered 
time interval, either the value zero, or the peak value (3 with 
a duty cycle of 7/3~ 1 . On the other hand, the infimum of 
the second trace expression is obtained, under the prescribed 
average power, when E[a;a; H ] = jl n . Calculation shows that 



i/(cc (n) ,y (n) )< 



max — p 

0<7<1V7T 




7/3-tr(fi 2 )^(l--)r(fc) i 



k=l 



+ l(P ~ 7) -tr((nondiag(.R)) 2 )r(0) 



Now, the maximization over 7 delivers 



7o P t 



2a 



-}• 



o(p 2 ). 
(36) 

(37) 



Thus, taking the limit n — > 00 yields — > p as in fl33T > and 
by the normalizations in (1261 , we end up by the result of the 
proposition]^ ■ 
We next turn to the question, whether the upper bound 
suggested by Proposition[T]is achievable. A closer examination 
of ( f35l > demonstrates that the upper bound could be achieved 
if the input distribution would satisfy, for any time instants k 
and I within the block of length n 



(38) 



with some fl ^ and a e {0, /?}, while having E[a^a/™)< H ] = 
7l„ as already mentioned in the proof of PropositionQ] Clearly 
an on-off frequency -shift keying (OOFSK) modulation for the 
input, as follows, can fulfill these conditions 



Xk 



Z-e jfcn , fee {!,..., n}, 



(39) 



where Z takes the value \ff3 with probability 7 op t/3 1 and zero 
with probability (1 — 7 opt /3 _1 ), and f2 is uniformly distributed 
over the set { . . . , ^"""^ }. This is exactly similar to the 
results of the unquantized case (6), IfRfl . 

2 Observe that fi and a are indicators for the temporal and spatial coherence, 
respectively. 



B. Discussions 

We observe from Proposition[T|that spreading the power into 
different taps does not affect the low SNR rate, while receiver 
correlation is beneficial due to two effects. First the mutual 
information increases with a defined in ( [34-b which is related 
to the norm of the off-diagonal elements of R. Second, under 
the normalization ti(R) = N, the Frobenius norm tr(i? 2 ) 
increases with more correlation among the receive antenna, 
and consequently, higher rates at low SNR can be achieved due 
to relation d33l) . Besides we note that the achievability of the 
upper bound stated in Proposition Q] as discussed previously, 
is obtained at the cost of burstiness in frequency which ma 
not agree with some specifications imposed on many systems 
Therefore, it is interesting to look at asymptotic rate of i.i.d. 
input symbols drawn from the set {— 0, I n mat case 
it turns out by ([8]l that 



1 



max 

0<7<1 



T-l 



7V n) 



£ a? +7(0 -7)* 



(40) 

Here, we observe that, contrary to to the FSK-like scheme, the 
mutual information with i.i.d input is negatively affected by 
the delay spread since Ym=o a t — 1 the normalization 



Nevertheless for the case a p, i.e. low temporal 
correlation (high Doppler spread), the gap to the upper bound 
in Proposition [T] vanishes, as demonstrated in Fig. [3] 




Fig. 3 

RATIO OF I.I.D. RATE ( l40t AND UPPER BOUND i TJTt VS. THE 
SPATIAL-TO-TEMPORAL COHERENCE RATIO - FOR 8 = 2 AND UNIFORM 
DELAY SPREAD, I.E. a t = ip. 



VI. Conclusion 

We derived an expression for the second-order expansion of 
the mutual information for general MIMO channels with one- 
bit ADC for low SNR and general input distribution. Based 

3 Note that such observations holds also in the unquantized case |6|. 



on this, we showed that the power penalty due to the 1- 
bit quantization is approximately equal J (1.96 dB) at low 
SNR in practical channel settings. This confirms that low- 
resolution (or maybe 1-bit) sampling in the low SNR regime 
(or equivalently in the wideband limit) performs adequately, 
while reducing power consumption. Besides, it turns out that 
both spatial and temporal correlations are extremely beneficial 
at low SNR (even more than in the unquantized case). We 
studied the special case of multiple receive antennas with 
delay spread and correlation both in time and space in detail 
and obtained the asymptotic achievable rate under average and 
peak power constraint. 
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